NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Sparse Autoencoders for Hypothesis Generation

Movva, Rajiv; Peng, Kenny; Garg, Nikhil; Kleinberg, Jon; Pierson, Emma (June 2025, International Conference on Machine Learning)

We describe HypotheSAEs, a general method to hypothesize interpretable relationships between text data (e.g., headlines) and a target variable (e.g., clicks). HypotheSAEs has three steps: (1) train a sparse autoencoder on text embeddings to produce interpretable features describing the data distribution, (2) select features that predict the target variable, and (3) generate a natural language interpretation of each feature (e.g., mentions being surprised or shocked) using an LLM. Each interpretation serves as a hypothesis about what predicts the target variable. Compared to baselines, our method better identifies reference hypotheses on synthetic datasets (at least +0.06 in F1) and produces more predictive hypotheses on real datasets (~twice as many significant findings), despite requiring 1-2 orders of magnitude less compute than recent LLM-based methods. HypotheSAEs also produces novel discoveries on two well-studied tasks: explaining partisan differences in Congressional speeches and identifying drivers of engagement with online headlines.
more » « less
Full Text Available
Sparse Autoencoders for Hypothesis Generation

Movva, Rajiv; Peng, Kenny; Garg, Nikhil; Kleinberg, Jon; Pierson, Emma (May 2025, ICML)

Full Text Available
Learning Disease Progression Models That Capture Health Disparities

Chiang, Erica; Shanmugam, Divya; Beecy, Ashley_N; Sayer, Gabriel; Estrin, Deborah; Garg, Nikhil; Pierson, Emma (May 2025, Conference on Health, Inference, and Learning)

Full Text Available
Quantifying disparities in intimate partner violence: a machine learning method to correct for underreporting

https://doi.org/10.1038/s44294-024-00011-5

Shanmugam, Divya; Hou, Kaihua; Pierson, Emma (December 2024, npj Women's Health)

Abstract The first step towards reducing the pervasive disparities in women’s health is to quantify them. Accurate estimates of therelative prevalenceacross groups—capturing, for example, that a condition affects Black women more frequently than white women—facilitate effective and equitable health policy that prioritizes groups who are disproportionately affected by a condition. However, it is difficult to estimate relative prevalence when a health condition is underreported, as many women’s health conditions are. In this work, we present , a method for accurately estimating the relative prevalence of underreported health conditions which builds upon the literature in positive unlabeled learning. We show that under a commonly made assumption—that the probability of having a health condition given a set of symptoms remains constant across groups—we can recover the relative prevalence, even without restrictive assumptions commonly made in positive unlabeled learning and even if it is impossible to recover the absolute prevalence. We conduct experiments on synthetic and real health data which demonstrate ’s ability to recover the relative prevalence more accurately than do previous methods. We then use to quantify the relative prevalence of intimate partner violence (IPV) in two large emergency department datasets. We find higher prevalences of IPV among patients who are on Medicaid, not legally married, and non-white, and among patients who live in lower-income zip codes or in metropolitan counties. We show that correcting for underreporting is important to accurately quantify these disparities and that failing to do so yields less plausible estimates. Our method is broadly applicable to underreported conditions in women’s health, as well as to gender biases beyond healthcare.
more » « less
Full Text Available
Generative Artificial Intelligence in Medicine

https://doi.org/10.1146/annurev-biodatasci-103123-095332

Shanmugam, Divya; Agrawal, Monica; Movva, Rajiv; Chen, Irene Y; Ghassemi, Marzyeh; Jacobs, Maia; Pierson, Emma (March 2025, Annual Review of Biomedical Data Science)

The increased capabilities of generative artificial intelligence (AI) have dramatically expanded its possible use cases in medicine. We provide a comprehensive overview of generative AI use cases for clinicians, patients, clinical trial organizers, researchers, and trainees. We then discuss the many challenges—including maintaining privacy and security, improving transparency and interpretability, upholding equity, and rigorously evaluating models—that must be overcome to realize this potential, as well as the open research directions they give rise to.
more » « less
Full Text Available
Race adjustments in clinical algorithms can help correct for racial disparities in data quality

https://doi.org/10.1073/pnas.2402267121

Zink, Anna; Obermeyer, Ziad; Pierson, Emma (August 2024, Proceedings of the National Academy of Sciences)

Despite ethical and historical arguments for removing race from clinical algorithms, the consequences of removal remain unclear. Here, we highlight a largely undiscussed consideration in this debate: varying data quality of input features across race groups. For example, family history of cancer is an essential predictor in cancer risk prediction algorithms but is less reliably documented for Black participants and may therefore be less predictive of cancer outcomes. Using data from the Southern Community Cohort Study, we assessed whether race adjustments could allow risk prediction models to capture varying data quality by race, focusing on colorectal cancer risk prediction. We analyzed 77,836 adults with no history of colorectal cancer at baseline. The predictive value of self-reported family history was greater for White participants than for Black participants. We compared two cancer risk prediction algorithms—a race-blind algorithm which included standard colorectal cancer risk factors but not race, and a race-adjusted algorithm which additionally included race. Relative to the race-blind algorithm, the race-adjusted algorithm improved predictive performance, as measured by goodness of fit in a likelihood ratio test (P-value: <0.001) and area under the receiving operating characteristic curve among Black participants (P-value: 0.006). Because the race-blind algorithm underpredicted risk for Black participants, the race-adjusted algorithm increased the fraction of Black participants among the predicted high-risk group, potentially increasing access to screening. More broadly, this study shows that race adjustments may be beneficial when the data quality of key predictors in clinical algorithms differs by race group.
more » « less
Full Text Available
Advancing science- and evidence-based AI policy

https://doi.org/10.1126/science.adu8449

Bommasani, Rishi; Arora, Sanjeev; Chayes, Jennifer; Choi, Yejin; Cuéllar, Mariano-Florentino; Fei-Fei, Li; Ho, Daniel E; Jurafsky, Dan; Koyejo, Sanmi; Lakkaraju, Hima; et al (July 2025, Science)

Policy must be informed by, but also facilitate the generation of, scientific evidence
more » « less
Full Text Available
MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning

Li, Shuyue_Stella; Balachandran, Vidhisha; Feng, Shangbin; Ilgen, Jonathan_S; Pierson, Emma; Koh, Pang_Wei; Tsvetkov, Yulia (December 2024, NeurIPS)

Full Text Available
MEDIQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning

Li, Shuyue_Stella; Balachandran, Vidhisha; Feng; Feng, Shangbin; Ilgen, Jonathan_S; Pierson, Emma; Koh, Pang_Wei; Tsvetkov, Yulia (December 2024, NeurIPS)

Full Text Available
Domain constraints improve risk prediction when outcome data is missing

Balachandar, Sidhika; Garg, Nikhil; Pierson, Emma (April 2024, ICLR)

Full Text Available

« Prev Next »

Search for: All records